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Ultimate periodicity of b-recognisable sets 
a quasilinear procedure 
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Abstract 



It is decidable if a set of numbers, whose representation in a base b 
is a regular language, is ultimately periodic. This was established by 
Honkala in 1986. 

We give here a structural description of minimal automata that 
Q . accept an ultimately periodic set of numbers. We then show that it 

can verified in linear time if a given minimal automaton meets this 
description. 

This yields a 0(nlog(n)) procedure for deciding whether a general 
, deterministic automaton accepts an ultimately periodic set of num- 

\^ ' bers. 
(N ■ 
T— I , 

O ■ 1 Introduction 

m 

Given a fixed positive integer b, called the base, every positive integer n is rep- 
resented (in base b) by a word over the canonical digit alphabet Ab = {0,1, ... ,b—l} 
which does not start with a 0. Hence sets of numbers are represented by lan- 
guages of A^*. Depending on the base, a given set of integers may be repre- 
sented by a simple or complex language: the set of powers of 2 is represented 
by the rational language 10* in base 2, and, in base 3, by a context-sensitive 
language, much harder to describe. 

A set of numbers is said to be b-recognisableii it is represented by a recog- 
nisable, or rational, or regular, language over A^ . On the other hand, a set of 
numbers is recognisable if it is, via the identification of N with a* {n -H- a"). 
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a recognisable, or rational, or regular, language of the free monoid a*. A 
set of numbers is recognisable if, and only if it is ultimately periodic (UP) 
and we use the latter terminology in the sequel as it is both meaningful and 
more distinguishable from 6-recognisable. It is common knowledge that ev- 
ery UP-set of numbers is 6-recognisable for every b, and the above example 
shows that the converse is not true. It is an exercice to show that if b and c 
are multiplicatively dependent (that is, there exist integers k and / such that 
b'' = c'), then every 6- recognisable set is a c- recognisable set as well (c/. [9] 
for instance). A converse of these two properties is the theorem of Cobham: 
a set of numbers which is both b- and c-recognisable, for multiplicatively in- 
dependent b and c, is UP, estabhshed in 1969 [5], a strong and deep result 
whose proof is difficult. 

After Cobham's theorem, the next natural (and last) question left open 
on 6-recognisable set of numbers was the decidability of ultimate periodicity. 
It was positively solved in 1986: 

Theorem 1 (Honkala [TT]). It is decidable whether an automaton over Ab* 
accepts an UP-set of numbers. 

The complexity of the decision procedure was not an issue in the original 
work. Neither were the properties or the structure of automata accepting UP- 
set of numbers: Given an automaton A over A^, bounds on the parameters 
of a potential UP-set of numbers if it were accepted by A are computed. 
The property is then decidable as it is possible to enumerate all automata 
that accept sets with smaller parameters and check whether any of them is 
equivalent to A. 

As explained below, subsequent works on automata and number represen- 
tation brought some answers to the question of the complexity of the decision 
procedure, explicitely or implicitely. The present paper addresses specifically 
this problem and yields the following statement. 

Theorem 2. It is decidable in linear time whether a minimal DFA A over A^ 
accepts an UP-set of numbers. 

As it is often the case, this complexity result is the consequence of a struc- 
tural characterisation. Indeed, we describe here a set of structural properties 
for an automaton: the shape of its strongly connected components (SCO's) 
and the one of its graph of SCO's, that we gather under the name of UP- 
criterion. Theorem [2] then splits into two results: 

Theorem 3. A minimal DFA A over Af^ accepts a UP-set of numbers if, 
and only if, it satisfies the UP-criterion. 
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Theorem 4. It is decidable in linear time whether a minimal DFA A over Ab 
satisfies the UP-criterion. 

As for Cobham theorem {of. [HE]), new insights on the problem tackled 
here are obtained when stating it in a higher dimensional space. Let N*^ be 
the additive monoid of d-tuples of integers. Every d-tnp\e of integers may 
be represented in base 6 by a (i-tuple of words of Ai,* of the same length 
as shorter words can be padded by O's without changing the corresponding 
value. Such d-tuples can be read by (finite) automata over (Ab)* — automata 
reading on d synchronised tapes — and a subset of N'^ is 6-recognisable if the 
set of the ^-representations of its elements is accepted by such an automaton. 

On the other hand, recognisable and rational sets of are defined in the 
classical way but they do not coincide as N'^ is not a free monoid anymore. 
A subset of N'^ is recognisable if is saturated by a congruence of finite index, 
and the family of recognisable sets is denoted by RecN'^. A subset of N'^ 
is rational if is denoted by a rational expression, and the family of rational 
sets is denoted by Rat M"^. Rational sets of N"^ have been characterised by 
Ginsburg and Spanier as sets definable in the Preshurger arithmetic ( N, + ) 
([To]), hence the name Preshurger definable that is most often used in the 
literature. 

It is also common knowledge that every rational set of N"^ is 6- recognisable 
for every 6, and the example in dimension 1 is enough to show that the 
converse is not true. The generalisation of Cobham's theorem: a subset ofW^ 
which is both b- and c-recognisable, for multiplicatively independent b and c, 
is rational, is due to Semenov (c/. [HE]). The generalisation of Honkala's 
theorem went as smoothly. 

Theorem 5 (Muchnik [U]). It is decidable whether a b-recognisable subset 
o/N*^ is rational. 

Theorem 6 (Leroux [H]). It is decidable in polynomial time whether a 
minimal DFA A over (Aif)* accepts a rational subset ofN'^. 

The algorithm underlying Theorem [5] is triply exponential whereas the one 
described in [H], based on sophisticated geometric constructions, is quadratic 
— an impressive improvement — but not easy to explain. 

There exists another way to devise a proof for Honkala's Theorem which 
yields another extension. In [10], Ginsburg et Spanier proved that being an 
UP-set of numbers (indeed, in higher dimension, being a recognisable subset 
of N'^) is expressible in Presburger arithmetic. In [2], it was then noted that 
since addition in base p is realised by a finite automaton, every Presburger 
formula is realised by a finite automaton as well. Hence a decision procedure 
that establishes Theorem [H 
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Generalisation of base p by non standard numeration systems then gives 
an extension of Theorem [H best expressed in terms of abstract numeration 
systems. Given a totally ordered alphabet A, any rational language L of A* 
defines an abstract numeration system (ANS) Sl in which the integer n is 
represented by the rtfl-th word of L in the radix ordering of A* {of. Il2j)- A 
set of integers whose representations in the ANS Sl form a rational language 
is called iSi-recognisable and it is known that every UP-set of numbers is Sl- 
recognisable for every ANS Sl ([13]). The next statement then follows. 

Theorem 7. // Sl is an abstract numeration system in which addition is 
realised by a finite automaton, then it is decidable whether a SL-recognisable 
set of numbers is UP. 

For instance, Theorem [7| implies that ultimately periodicity is decidable 
for sets of numbers represented by rational sets in a Pisot base system [S]. 
The algorithm underlying Theorem [7] is exponential (if the set of numbers is 
given by a DFA) and thus (much) less efficient than Leroux's constructions for 
integer base systems. On the other hand, it applies to a much larger family 
of numeration systems. All this was mentioned for the sake of completeness, 
and the present paper does not follow this pattern. 

Theorem [6l restricted to dimension 1, readily yields a quadratic proce- 
dure for Honkala's theorem. The improvement from quadratic to quasilinear 
complexity achieved in this article is not a natural simplification of Leroux's 
construction for the case of dimension 1. Although the UP-criterion bears 
similarities with some of Leroux's features, it is not derived from [H], and 
nor is the proof of quasilinear complexity. 

The paper is organised as follows: 

In Section [21 we treat the special case of determining whether a given 
minimal group automaton accepts an ultimately periodic set of numbers. 
We describe canonical automata, which we call Pascal automata, that accept 
such sets. We then show how to decide in linear time whether a given minimal 
group automaton is the quotient of some Pascal automaton. 

Section [3] introduces the UP-criterium and sketches both its complete- 
ness and correction. An automaton satisfying the UP-criterium is a directed 
acyclic graph (DAG) 'ending' with at most two layers of non-trivial strongly 
connected components (SCO's). If the root is seen at the top, the upper 
(non-trivial) SOO's are circuits of O's and the lower ones are quotients of 
Pascal automata. It is easy, and of linear complexity to verify that an au- 
tomaton has this overall structure. This criterium is sketched in Figure [H 
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(0-circuits) (Pascal) 




2 The Pascal automaton 



2.1 Preliminaries 
2.1.1 On automata 

In the following, a deterministic finite automaton is denoted by A= {Q, A,S,i,T) , 
where Q is a set of states; A is a set of letters, called the alphabet; S: 
Q X A ^ Q is the transition function; i is a particular state in Q, called 
the initial state; T is a subset of Q, called the set of final states. 

As usual, 6 is extended to a function Q x A* — > Q hy S{q,e) — q 
and 5{q,ua) — 5{5{q,u),a); and S{q,a) will also be denoted by g • a, for 
short. 

A word u of A* is said to be accepted by .4. if i • -u is in T. The language 
(i.e. the set of words) accepted by A is called the behaviour of A, denoted 
by . A langage is said to be rational (or regular) if it is the behaviour of 
some automaton. 

If we ignore the labels, the initial and final states, an automaton is a 
directed graph, called underlying graph. 

With any rational language L is associated a minimal automaton. It is the 
deterministic automaton accepting L with the smallest number of states This 
automaton is unique up to isomorphism and can be computed in 0{nlog{n)) 
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time, given an automaton with n states accepting L [cf. [T] for instance). 

Given a deterministic automaton A, every word u induces an 
application {q — )■ q ■ u) over the state set. These apphcations form a fi- 
nite monoid, called the transition monoid of A. When this monoid happens 
to be a group (meaning that the action of every letter is a permutation over 
the states), A is called a group automaton. 

2.1.2 On numbers 

The base b is fixed throughout the paper (it will be a parameter of the al- 
gorithms, not an input) and so is the digit alphabet Ab. As a consequence, 
the number of transitions of any automaton over Ab is linear in its num- 
ber of states. Verifying that an automaton is determinitic (resp. a group 
automaton) can then be done in linear time. 

For our purpose, it is far more convenient to write the integers least 
significant digits first (LSDF), and to keep the automata reading from left 
to right. The value of a word u = aoai---an of Ab , denoted by u, is 
then u = Y17=oi^i^^) ^le obtained by the recursive formula: 



Conversely, every integer n has a unique canonical representation in base b 
that does not end with 0, denoted by (n). A word of Ab has value n if, and 
only if, it is of the form (n) O'^. 

By abuse of language, we may talk about the set of numbers accepted by 
an automaton. An integer n is then accepted if there exists a word of value n 
accepted by the automaton. 

A set £^ C N is periodic, of period p, if there exists R C {0, 1, . . . ,p — l} 
such that i? = {nGN|3rG-R n = r [p]}. A periodic set E has a smallest 
period, which is the period of E and the corresponding R is the set of residues 
of E: the set E is then denoted by E^. The set of numbers in E^ and larger 
than an integer m is denoted by E^ ,^. 

2.2 Definition of a Pascal automaton 

We begin with the construction of an automaton that accepts the set E^, 
in the case where 



ua = u + ab^ 



u 




p is coprime with b. 
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We call any such automaton a Pascal automaton\j If p is coprime with 6, 
there exists a (smallest) integer ip such that: 

b^ = l[p] and thus VxgN 6" = 6" '""'^ ^ [p] . 

Therefore, from Equation ([1]), knowing m mod p and |m| mod ip is enough to 
compute ua mod p. 

Hence the definition of = {Z/pZx Z/ipZ, Ab, r], (0, 0), x Z/^Z ) , where 

V(s,t) G Z/pZ X Z/ipZ, \fa e r7((s,t),a) = {s,t) ■ a = {s + ab\t + l) 

(2) 

By induction on it follows that (0,0) ■ u = [u mod p,\u\ mod ■?/') for 
every u in A;,* and \Vp\ = . 

Example 8. Fig.\E shows Pf, the Pascal automaton accepting integers writ- 
ten in binary and congruent to 2 modulo 3. For clarity, the labels are omitted; 
transitions labelled with 1 are drawn with thick lines and those labelled with 
with thin lines. 




Figure 2: The Pascal automaton P| 



In fact, this construction does not require that p and R be canonical. For 
arbitrary p (still prime with b) and R, we call the automaton constructed in 
this way a generalised Pascal automaton and denote it by Qp. 

2.3 Recognition of quotients of Pascal automata 

The tricky part of achieving a linear complexity for Theorem H] is contained 
in the following statement: 

Theorem 9. It is decidable in linear time whether a minimal DFA A over Al 
is the quotient of a Pascal automaton. 

^ As early as 1654, Pascal describes a computing process that generalises the casting 
out nines and that determines if an integer n, written in any base b, is divisible by an 
integer p (see [HI Prologue]). 
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Simplifications Since is a group automaton, all its quotients are group 
automata. 

The permutation on Z/pZ x X/i/jX realised by O^'^"^-* is the inverse of the 
one realised by and we call it the action of the "digit" 0~^. Let g he a new 
letter whose action on Z/pZ x Z/ipZ is the one of 10~^. It follows from ([2]) 
that for every a in — where a is understood both as a digit and as a 
number — the action of a on Z/pZ x Z/ipZ (in Vp) is equal to the one 
of g'^0. The same relation holds in any group automaton A over Ab* that is 
a quotient of a Pascal automaton, and this condition is tested in linear time. 

Let B = {0, g} be a new alphabet. Every group automaton A= {Q, Af,, 6, i, 
which is potentially a quotient of a Pascal automaton may be transformed 
into an automaton A' = {Q,B,6',i,T) where, for every q in Q, S'{q,0) = 
5{q,0) and 5'{q, g) = d{q,10-^) . 

Fig.[3] shows P'3 where transitions labelled with are drawn with thin 
lines and those labelled with g with double linesH 



T) 




Figure 3: The modified Pascal automaton V'^ 



Analysis: computation of the parameters From now on, and for the 

rest of the section, A= {Q, A^,, S,i,T) is a group automaton which has been 
consistently transformed into an automaton A' = {Q, A,6' ,i,T). If A' is a 
quotient of a Pascal automaton Vp, then the parameters p and R may be 
computed (or 'read') in A'; this is the consequence of the following statement. 

Proposition 10. Let (f: Vp A! he a morphism. Then, for every {x,y) 
and {x',y') in Z/pZ x Z,/ipZ,, if ip{x,y) = ip{x' ,y') , then y ^ y' . 

Proof. Ab absurdo. Since {x,y) > (0,0), (p{x,y) = (p{x',y) im- 

■p/R 

P 

plies ip{0, 0) = ip{z, 0) with z = x' — x. Such z may be chosen minimal, 
and the image by of the (/-circuit containing (0, 0) in Vp (of length p) is 
a ^f-circuit in A' of length z. 

^The transformation makes even clearer that the transition monoid of Vp (and thus 
of Vp) is the semi-direct product Z/pZ x: Z/^Z. 
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Since the image, and the inverse image, by a quotient morphism of a final 
state is final, if r is in R, that is, if (r, 0) is final, so is (r + z,0), and the 
pair p, R is not canonical. □ 



Corollary 11. If A' — {Q,A,S',i,T) is a quotient of a modified Pascal 
automaton V^, then p is the length of the g-circuit in A' which contains i 
and R — {r\i-g'^^T}. 

Next, if A! is a quotient of a (modified) Pascal automaton V'^, the equiv- 
alence class of the initial state of V'^ may be 'read' as well in A' as the 
intersection of the 0-circuit and the (/-circuit around the initial state of A! . 

More precisely, and since (0, 0) ^ (s, 0) {s, t) , the following holds. 

'p/R T>fR 
p P 

Proposition 12. Let ip: V'p A' be a morphism. Then (f{s,t) — <^(0, 0) 
if and only if i ■ g^ = i ■ 0~* . 

From this proposition follows that, given A' ^ it is easy to compute the class 
of (0, 0) modulo if if A! is indeed a quotient of a (modified) Pascal automaton 
by !~p. Starting from i, one first marks the states on the ^f-circuit C. Then, 
starting from i again, one follows the 0~^-transitions: the first time C is 
crossed yields t. This parameter is characteristic of </?, as explained now. 

Let {s,t) be an element of the scmidirect product Gp = Z/pZ x Ij/ipZ 
and T(^s,t) the permutation on Gp induced by the multiplication on the left 
by {s,ty. 

T(s,t)iix,y)) = {s,t){x,y) = {xb* + s,y + t) . (3) 
The same element (s, t) defines a permutation o'(^s,t) on Z/pZ as well: 

Vx e Z/pZ (T{s,t){^) = x6* + s . (4) 

Proposition 13. Let ip: V'p A! be a morphism and let {s,t) be the 
state ip-equivalent to (0, 0) with the smallest second component. Then, every 
equivalence class modulo (/? is an orbit of T(^s,t) o-i^d R is an union of orbits 
ofcr^s,t)- 

Proof. Since </? is a morphism and T(^s,t) multiplies on the left by (s, t) (which 
is (/^-equivalent to (0,0)), it follows that T(^s,t) is stable on every (/^-equivalence 
class. This already proves that i? is a union of orbits of o'(^s,t)- 

Moreover, one can reduce the case where there are two elements h and h' 
in the same (/^-equivalence class but in a different r(s,t)-orbit, to the case where 
one of them is (0, 0), by multiplying both by on the right. 
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We denote by C the (/^-equivalence class of (0,0). The action of T(^s,t) on 
the second component is simply to add t. It follows that there cannot be two 
vertices and {x2,y2) in C such that \yi — ^2! < t, since applying T(s,t) 

enough time to both would yield a vertex in C whose second component is 
smaller than t. 

For all j, tJ^^ ^^(0, 0) is equal to {zj,jt) for some Zj, and is in C (since r^-'^ 
is stable over C). There cannot be any other state {x, y) in C, since otherwise 
there would exist some j such that jt ^ y < {j + l)t, and then \y — jt\ < t. 

Hence C is the orbit of (0,0) for r^s^t)- CH 



Synthesis: verification that a given automaton is a quotient of a 
Pascal automaton Given A" = {Q, A,S',i,T), let p, R and (s,t) com- 
puted as explained above. It is easily checked that R is an union of orbits 
of (7(^s,t) and that \\Q\\ = pt. The last step is the verification that A' is indeed 
(isomorphic to) the quotient of V'p by the morphism ip defined by {s,t). 

A corollary of Proposition [13] (and of the multiplication law in Gp) is 
that every class modulo ip contains one, and exactly one, element whose 
second component is smaller than t. From this observation follows that the 
multiplication by the generators = (0,1) and g = (1,0) in the quotient 
of V'p by ip may be described on the set of representatives 

Q^ = {{x,z)\x e z/pZ, z e z/tz} 

(beware that z is in Z/tZ and not in Z/ipZ) by the following formulas: 



V(x, z) G 

{x,z)-0 = ix,z) (0,1) 



{x,z + l) ii z<t-l 

r,\Jx,z + l) = {^,0) if z = t-l 



'(s,t)V-^? ^ ^ — V fet 
{x,z)-g = {x,z){l,0) = {x + b',z) . 

Hence A' is the quotient of V'p by ip if one can mark Q according to these 
rules, starting from i with the mark (0,0), without conflicts and in such a 
way that two distinct states have distincts marks. Such a marking is realised 
by a simple traversal of A', thus in linear time, and this concludes the proof 
of Theorem [91 

Remark 14. Theoreml^ states that one can decide in linear time whether a 
given automaton A is a quotient of a Pascal automaton, and in particular A 
has a fixed initial state that plays a crucial role in the verification process. 

The following proposition shows that the property (being a quotient of a 
Pascal automaton) is actually independant of the state chosen to be the initial 
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one. If it holds for A, it also holds for any automaton derived from A by 
changing the initial state. This is a general property that will be used in the 
general verification process described in the next section. 

Proposition 15. // an automaton A = {Q,A,6,i,T) is the quotient ofVp, 
then for every state q in Q, Aq={Q^ A,6,q,T) is the quotient of V^' for 
some set R' . 

Proof. Since the morphism associated with a quotient does not depend on 
the initial state, it is enough to prove that changing the initial state of a 
Pascal automaton yield another Pascal automaton with the same period. 

It is then easy to verify that, if the new initial state is (s,t), the new 
automaton is equal to where S = | r G R}, the state {x,y) of Vp 
corresponding to the state (s + xb*, y + t) of Vp. □ 

3 The UP-criterion 

Let A = {Q, A, E, I,T) be an automaton, a the strong connectivity equiv- 
alence relation on Q, and 7 the surjective map from Q onto Q/a. The 
condensation Cj^ of A is the morphic image of A by 7. The condensation 
of A is computed in linear time by Tarjan's algorithm (c/. [B]). 

Definition 16 (The UP-criterion). Let A be a deterministic automaton 
on A}^ and C_4 its condensation. 

UP-0 The successor by of a final (resp. non- final) state of A is final (resp. 
non- final). 

UP-1 Every SCC stable by a non-0 digit is a leaf o/C^ (called Type 1 SCC). 

UP-2 Every non trivial SCC which is not of Type 1 is a simple 0-circuit 
and has a unique Type 1 SCC as successor in C_a; it is called a Type 2 
SCC. 

UP-3 Every Type 1 SCC is the quotient of a Pascal automaton Vp for 
some R and p. 

UP-4 Let C be a Type 2 SCC and D the Type 1 SCC that is its successor 
in C^. Then, 7~^(CUL') is a covering of-y~^{D), apart from final states. 

We call UP-automaton an automaton that satisfies the UP-criterion. 
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The schematic representation of the UP-criterion at Fig.[T]allows to review 
items 1 to 4. There are two levels of SCO's in the condensation; squares and 
ovals. Squares are the Type 1 SCO's, leaf of C ((UP-1)). Each of them is the 
quotient of a Pascal automaton and as such are complete ((UP-3)). Ovals are 
the Type 2 SCO; each of them has for unique successeur a square ((UP-2)) 
and 'behaves in the same manner' as a circuit of O's from this square (dotted 
lines): that is, every vertex of a Type 2 SOO is associated with a vertex of 
a 0-circuit of the Type 1 SOO and two associated vertices have the same 
behaviour: their respective successor by are associated and they have the 
same successor by a non-0 digit ((UP-4)). 

Example 17. shows a complete but simple example of an automaton 

satisfying the UP-criterion. 

The framed subautomata are the minimisation of Pascal automata V^^'^^ 
on the top and pj^'^'^'^^ on the bottom. The two others non-trivial SCO's are 
respectively {-82,^2} (^iT'd {-D2} o^^e reduced to 0-circuits. Each of them has 
successors in only one Pascal automaton. 

The dotted lines highlight the covering: the circuits {B,C) and (i?2,C2) 
behave similarly: B is associated with B2 and C with C2. The successors 
by of B and B2 are respectively C and C2 which are associated, and their 
successor by 1 is the same, namely A. A similar observation can be done 
for C and C2 • 



Q 




Figure 4: A complete example of the UP-criterion 



Completeness and correctness of the UP-criterion are established as fol- 
lows. 

1. Every UP-set of numbers is accepted by an UP-automaton. 
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2. The UP-criterion is stable by quotient; 



3. Every UP-automaton accepts an UP-set of numbers. 

The two first steps insure completeness for minimal automata (as every b- 
recognisable set of numbers is accepted by a unique minimal automaton ), the 
third one plays for correctness. 

3.1 Every UP-set of numbers is accepted by a UP- 
automaton 

Proposition 18. For every integers m and p and for every set R of residues 
there exists an UP-automaton accepting E^^. 

3.1.1 When the period divides a power of the base 

Let Ep be a periodic set of numbers such that p \ V for some j. An automaton 
accepting E^ is obtained by a generalisation of the method for recognising 
if an integer written in base 10 is a multiple of 5, namely checking if its unit 
digit is or 5: from ([T]) follows: 

Lemma 19. Let d he an integer such that d \ V (and dj(b^~^) and u in A}^ of 
length j. Then, w in A^* is such that W = u[d] if, and only if, w = uv for a 
certain v. 

Figure O shows an example of such a construction. 



Figure 5: Automaton accepting integers congruent to 1 modulo 4 en base 2 



3.1.2 The case of periodic sets of numbers 

Let Ep be a periodic set of numbers. In contrast with Sect. 12. 21 p and b 
are not supposed to be coprime anymore. Given a integer p, there exist k 
and d such that p = kd, k and b are coprime, and d \ V for a certain j. The 
Chinese remainder theorem, a simplified version of which is given below. 
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allows to break the condition: 'being congruent to r modulo p' into two 
simpler conditions. 

Theorem 20 (Chinese remainder theorem). Let k and d he two coprime 
integers. Let r/^, he two integers. There exists a unique integer r < kd 
such that r = rk[k] and r = rd[d]. 

Moreover, for every n such that n = rk[k] and n = rd[d], then n = r[kd]. 

Let us assume for now that i? is a singleton {r}, with r in {0, 1, . . . ,p — l} 
and define = (r mod d) and = (r mod k). The Chinese remainder 
theorem implies: 

G N n = r [p] <^=^ n = Vk [k] and n = Vd [d] . (5) 

The Pascal automaton V^'' accepts the integers satisfying n = [k] and 
an automaton accepting the integers satisfying n = [d] is described in 
Section 13.1.11 

Taking the product of the two automata yields an automaton that accepts 
the integers satisfying both equations of the right handside of and this is 
an UP-automaton. 

Example 21. The following figures show the construction of an automaton 
accepting the set of representations in hase 2 of the integers congruent to 5 
modulo 12. Fig.l^showsV^ , minimised for clarity, Fig^^shows an automaton 
accepting integers congruent to 1 modulo A, and Fig.\^ shows the product of 
the preceeding two, which accepts the required set of numhers. 




Figure 6: The minimisation of V23 

Let now R = {ri,r2, . . . ,re} be a subset of {0, 1, . . . ,p — l}. In order to 
build an automaton Bp that accepts E^, let S = {{ri ^, ri^k), {^"24, '^2,k)^ ■ ■ ■ , {^(.4^ '"^.fc)} 
be the set of pairs (r^ rf, fc) such that an integer n is congruent to rj modulo p 
if, and only if, both n = Vi^k [k] and n = ri^^L [d]- 

For every x < d, let = {vi^k \ x = Vi^d}, which means that ii n = x [d] 
then n is in if, and only if, it is congruent to some t in T^ modulo k. It 
may be the case that for some x, {k, T^) are not the canonical parameters 
for -EJ"". An automaton that accepts E^^ is thus written as a generalised 
Pascal automaton ^J"". 

The automaton consists then in a complete 6-tree of depth j, whose b' 
leaves are replaced by generalised Pascal automata. More precisely, the 
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Figure 7: Automaton accepting integers congruent to 5 modulo 12 in base 2 

word u of length j reaches the state q of G^^ where u = x [d] and q is defined 
by (0, 0) q. It is a routine to verify that the automaton constructed in 

such a way accepts and satisfies the UP-criterion. 

Example 22. Fig.l^ shows the construction of an automaton accepting the 
set of representations in base 3 of the integers congruent to 0, 2, A, 5, and 9 
modulo 18. The various parameters are: 

p = 18, k = 2, d = 9, J = 2; R = {0,2,4,5,9}; S = 
{(0,0), (2,0), (4,0), (4,1), (0,1)}; 

To = {0, 1} since both (0, 0) and (0, 1) are in S; 

T, = Ts = n = Tj = Ts = 0; T2 = = {0}; and T, = {!}. 




Figure 8: Automaton accepting n = 0, 2, 4, 5, 9 mod 18 in base 3 

3.1.3 The case of arbitrary UP-set of numbers 

Let us first denote by Dm the automaton accepting words whose value is 
greater than m. It consists in a complete 6-tree of depth \logb{m)] plus a 
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final sink state. Every state may be labelled by the value of the word reaching 
it and it is final if its label is greater than m. Additionaly, every leaf loops 
onto itself by reading a and reaches the sink state by reading any other 
digit. 

Example 23. Fig.l^ shows P5 (in base 2). 




Figure 9: Automaton accepting n ^ 5 in base 2 



Every Dm is obviously an UP-automaton. 

An arbitrary UP-set of numbers -E^^ is accepted by the product x V^, 
denoted by Bp ,^. The very special form of Vm makes it immediate that the 
product is an UP-automaton, and this complete the proof of Proposition [T8l 

Example 24. The following figures show the construction of the automa- 
ton B241 accepting non-negative integers congruent to modulo 24- 

Figure [771 shows the automaton T>i, Figure IW\ shows the automaton B24 
and Figure [HI shows their product, 1 . 

3.2 The UP-criterion is stable by quotient 

Proposition 25. If A is a UP-automaton, then every quotient of A is also 
a UP-automaton. 

The UP-criterion relies on properties of SCC's and that are stable by 
quotient. The proof of Proposition[2B]then consists essentially in proving that 
SCC's are maped into SSC's by the morphism. It is given in the appendix 
(pagelHj). 
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Figure 11: Vi Figure 12: Vi x B^^ = B\ 



Lemma 26. Let A and B be two deterministic finite automata, and (p a 
morphism defining a quotient A ^ B. 

Every SCC of B is the quotient by ip of a SCC of A. 

Proof. 1. If two vertex x and y are strongly connected in A, then <f{x) 
and (f{y) are strongly connected in B 

This is a direct consequence of the morphism. 

2. For every SCC C ofB, there exist a SCC C of A such that i^{C) = C . 
Let 5* its inverse image of C by 

(a) There exists a strongly connected set of states V contained in S , 
with no edge going from V to S\V. 

■^If the automaton is not deterministic, it would work as well, with the appropriate 
definition of morphism (see |16|). 



17 



One can order S with the reachabihty relation. Since 5* is finite, 
there is a minimal equivalency class. We denote this set of states 
by V. 

(b) (p(y) is equal to C . 

For all vertex x in S* and y' in C", x can reach (without leav- 
ing S) some vertex of ip~^{y'), a direct consequence of morphism. 
If X is taken in V, it means that y' is in ^p{V), for all y' in C, 
hence fiV) = C 

Let us denote by C the SCC containing V. Since f{C) is strongly 
connected (from (1)) and contains C, then ip{C) = C. 

3. For every SCC C of B, there exist a SCC C of A such that ip\c induces 
a morphism from C to C . 

We denote by V the set of SCC of A whose image by ip is C . Since V 
is not empty (from (b)), there exists a SCC C of F that cannot reach 
any other SCC of F. It follows that every internal transition of C is 
also internal in C, hence ip\c induces a morphism from C to C". 

□ 

Proof of Proposition Let ^ be a UP-automaton and B one of its quo- 
tients. The very definition of a quotient implies that B satifies (UP-0). 

Lemma 1261 forces every Type 1 (resp. Type 2) SCC of A to be the quotient 
by of a Type 1 (resp. Type 2) SCC of A. Proving that B satifies (UP-1) 
up to (UP-4), is then immediate. 

□ 

3.3 Every UP-automaton accepts an UP-set of num- 
bers 

Since a finite union of UP-sets of numbers is still UP, and since the paths of 
a finite DAG are the paths of a finite union of linear graphs, it is sufficient 
to establish the following proposition. 

Proposition 27. Every UP-automaton A whose condensation C^^ is linear 
accepts an UP-set of numbers. 

Proof. Without loss of generality (that is, up to a finite number of elements 
in 1^1), one can assume that all final states belong to the SCC's of A. More- 
over, if A has both a Type 1 and a Type 2 SCC, one can assume, by (UP-4) 
and up to the addition or subtraction of one element in |^|, that the Type 2 



18 



sec has the same final or non-final status as its image in the Type 1 SCC S 
of A, and then, by minimisation, that A has no Type 2 SCC. 

Let u be the shortest (and unique) word that sends the initial state i, the 
root of C_A, into S: i j. Let Sj be the automaton obtained from S by 

taking j as initial state. By (UP-3) (and Proposition [T^ Sj is a quotient of 
a Pascal automaton and accepts a periodic set of numbers E^. 

Every w in |^| is of the form w = uv with v in \Sj\. Hence w = u + 6'"'u 
and w in |^| if, and only if, W ^ U and w belongs to b^'^^Ep. Then, \A\ is an 
UP-set of numbers of period pfe'"'. □ 



4 Conclusion and future work 

This work almost closes the complexity question raised by the Honkala's 
original paper fTT]. The simplicity of the arguments in the proof should not 
hide that the difficulty was to make the proofs simple. Two questions remain: 
getting rid, in Theorem [2] of the minimality condition; or of the condition of 
determinism. 

We are rather optimistic for a positive answer to the first one. Since the 
minimization of a DFA whose SCC's are simple cycles can be done in linear 
time (cf [3]), it seems that the higher part of the UP-criterion (DAG and 
Type 2 SCC's) should not be hard to verify. It remains to find an algorithm 
deciding in linear time whether a given DFA has the same behaviour as a 
Pascal automaton. This is the subject of still ongoing work of the authors. 

On the other hand, defining a similar UP-criterion for nondeterministic 
automata seems much more difficult. The criterium relies on the form and 
relations between SCC's, and the determinisation process is prone to destroy 
them. 
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